
<120> Novel Human alpha 1 Chain Collagen 



<130> 32350-176844 

<140> US 09/996,611 

<141> 2001-11-30 

<150> TAIWAN 89128027 

<151> 2000-12-27 

<160> 5 

<170> Microsoft Word 

<210> 1 

<211> 954 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> alpha 1 chain collagen 

Pro in this sequence stands for hydroxyproline 

<400> 1 



Met 


Ala 


His 


Tyr 


He 


Thr 


Phe 


Leu 


Cys 


Met 


Val 


Leu 


Val 


Leu 


Leu 


1 






5 










10 










15 


Leu 


Gin 


Asn 


Ser 


Val 


Leu 


Ala 


Glu 


Asp 


Gly 


Glu 


Val 


Arg 


Ser 


Ser 










20 










25 










30 


Cys 


Arg 


Thr 


Ala 


Pro 


Thr 


Asp 


Leu 


Val 


Phe 


He 


Leu 


Asp 


Gly 


Ser 






35 










40 










45 


Tyr 


Ser 


Val 


Gly 


Pro 


Glu 


Asn 


Phe 


Glu 


He 


Val 


Lys 


Lys 


Trp 


Leu 






50 










55 










60 


Val 


Asn 


He 


Thr 


Lys 


Asn 


Phe 


Asp 


He 


Gly 


Pro 


Lys 


Phe 


He 


Gin 










65 










70 










75 


Val 


Gly Val 


Val 


Gin 


Tyr 


Ser 


Asp 


Tyr 


Pro 


Val 


Leu 


Glu 


He 


Pro 










80 










85 










90 


Leu 


Gly 


Ser 


Tyr 


Asp 


Ser 


Gly 


Glu 


His 


Leu 


Thr 


Ala 


Ala 


Val 


Glu 








95 










100 










105 


Ser 


He 


Leu 


Tyr 


Leu Gly 


Gly Asn 


Thr 


Lys 


Thr 


Gly 


Lys 


Ala 


He 










110 










115 










120 


Gin 


Phe 


Ala 


Leu 


Asp 


Tyr 


Leu 


Phe 


Ala 


Lys 


Ser 


Ser 


Arg 


Phe 


Leu 










125 










130 










135 


Thr 


Lys 


He 


Ala 


Val 


Val 


Leu 


Thr 


Asp 


Gly 


Lys 


Ser 


Gin Asp Asp 








140 










145 










150 


Val 


Lys 


Asp 


Ala 


Ala 


Gin 


Ala 


Ala 


Arg 


Asp 


Ser 


Lys 


He 


Thr 


Leu 
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SEQUENCE LISTING 









155 










160 








165 


Phe 


Ala 


lie 


(jiy vai 




Cot* 




Thr* 


Glu Asp 


Ala 


Glu 


Leu Arg 








170 










175 








180 


Ala 


He 


Ala 


Asn Lys 


Pro 


Cqv 


Coy* 


1 ii-L 


Tyr 


Val 


Phe 


Tyr 


Val Glu 






185 










190 








195 


Asp 


Tyr 


lie 


Aia ±±e 


Co y~ 

OCX. 


T .wq 
i_i y o 


Tie 

J. X C 


Arg 


Glu 


Val 


Met 


Lys 

J 


Gin Lys 




200 










205 








210 


Leu 


Cys 


bill 




V d-L 


V— y is 


prn 


Thr 


Arg 


He 


Pro 


Val 


Ala Ala 






215 










220 








225 


Arg 


Asp 


Glu 


Arg oiy 


DV-io 
.rile 


nop 


Tip 
lie 


Leu Leu Gly 


Leu 


Asp 


Val Asn 




230 










235 








240 


Lys 


Lys 


Val 


Lys Lys 


Arg 


I le 


tain 


Leu 


Ser 


Pro 


j 1 y 0 


Lys 


He Lys 




245 










250 








255 


Gly Tyr 


Glu 


Val Tnr 


Ser 


Lys 


1 

Val 


Asp 


Leu 


Ser 


Glu 


Leu 


Thr Ser 








260 










265 








270 


Asn 


Val 


Phe 


Pro Glu 


Gly 


T o l i 


Pro 


Pro 


Ser 


Tyr 


Val 


Phe 


Val Ser 








275 








280 








285 


Thr 


Gin Arg 


Phe Lys 


Val 


Liy fa 


Liyo 


He 


Trp 


Asp 


Leu 


Trp 


Arg He 








290 










295 








300 


Leu 


Thr 


He Asp Gly Arg 


Pro 


Pi y, 

bin 


He 


Ala 


Val 


Tin t~ 


Leu Asn Gly 








305 










310 








Jib 


Val 


Asp 


Lys 


He Leu 


Leu 


DVio 


1 il-L 


Thr 


Thr 


Ser 


Val 


He 






320 










325 








"5 "3 n 

33 U 


Ser 


Gin 


Val 


Val Thr 


Phe 


/\-L 3L 




Pro 


Gin 


Val 


Lys 


Thr 


Leu Phe 




335 










340 








345 


Asp 


Glu 


Gly 


Trp His 


Gin 


lie 


Aircj 


Leu 


Leu 


Val 


Thr" 


Glu 


Gin Asp 




3 50 










355 








360 


Val 


Thr 


Leu 


Tyr He 


Asp 






Gin 


He 


Glu 


Asn 


Lys 


Pro Leu 








365 










370 








375 


His 


Pro 


Val 


Leu Gly 


He 


T Ol 1 


Tip 

X -L C 


Asn Gly Gin 


Thr 


Gin 


He Gly 








380 










385 








390 


Lys 


Tyr 


Ser 


Gly Lys 


Glu 


(j±U 


Thr 


Val 


Gin 


Phe 


Asp 


Val 


Gin Lys 




395 










400 








405 


Leu 


Arg 


He 


Tyr Cys 


Asp 


Pro 


Glu 


Gin 


Asn 


Asn 




Glu 


Thr Ala 






410 










415 








420 


Cys 


Glu 


He 


Pro Gly 


Phe 


Pirn 


Leu Asn Gly 


Pro 


Ser 


Asp 


Val Gly 






425 










430 








435 


Ser 


Thr 


Pro 


Ala Pro 


Cys 


He 


Cys 


Pro 


Pro 


Gly 


Lys 


Pro 


Gly Leu 








440 










445 








450 


Gin 


Gly 


Pro 


Lys Gly Asp 


Pro 


Gly 


Leu 


Pro 


Gly Asn 


Pro 


Gly Tyr 






455 










460 








465 


Pro 


Gly Gin 


Pro Gly Gin Asp Gly Lys 


Pro 


Gly Tyr 


Gin 


Gly He 








470 










475 








480 


Ala 


Gly 


Thr 


Pro Gly Val 


Pro 


Gly 


Ser 


Pro 


Gly 


He 


Gin 


Gly Ala 






485 










490 








4 95 


Arg 


Gly 


Leu 


Pro Gly Tyr 


Lys 


Gly Glu 


Pro Gly Arg Asp 


Gly Asp 




500 










505 








510 


Lys 


Gly Asp Arg Gly 


Leu 


Pro Gly 


Phe 


Pro 


Gly 


Leu 


His 


Gly Met 
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SEQUENCE LISTING 







515 






520 








525 


Pro 


Gly Ser 


Lys Gly Glu 


Met 


Gly Ala 


Lys 


Gly Asp 


Lys 


Gly Ser 




530 






535 








540 


Pro 


Gly Phe 


Tyr Gly Lys Lys Gly Ala Lys Gly Glu Lys Gly Asn 




545 






550 








555 


Ala Gly Phe 


Pro Gly Leu 


Pro 


Gly Pro 


Ala 


Gly 


Glu 


Pro 


Gly Arg 






560 






565 








570 


His 


Gly Lys 


Asp Gly Leu 


Met 


Gly Ser 


Pro Gly 


Phe 


Lys 


Gly Glu 




575 






580 








585 


Ala 


Gly Ser 


Pro Gly Ala Pro Gly Gin Asp Gly Thr Arg Gly Glu 




590 






595 










Pro Gly He 


Pro Gly pne 


Pro 


Gly Asn Arg 


Gly 


Leu 


Met 


Gly Gin 






605 






610 








615 


Lys Gly Glu 


lie Gly Pro 


Pro 


Gly Gin 


Gin 


Gly Lys 


Lys 


Gly Ala 






620 






625 








63 0 


Pro 


Gly Met 


Pro Gly Leu 


weu 


Gly Ser Asn Gly 


Ser 


Pro 


Gly Gin 




635 






640 








645 


Pro 


Gly Thr 


Pro Gly Ser 


Lys 




Lys 


Gly Glu 


Pro 


Gly He 




650 






655 








660 


Gin 


Gly Met 


Pro Gly Ala 


Ser 


Pi T 1 

\j±y J_itrLi 


Lys 


Gly 


pin 


Pro 


Gly Ala 




665 






670 








675 


Thr 


Gly Ser 


Pro Gly Glu 


Pro 


biy lyr 


Met 


Gly 


Leu 


Pro 


Gly He 




680 






685 








690 


Gin 


Gly Lys 


Lys Gly Asp 


Lys 


Gly Asn 


Gin 


Gly 


(j±U 


Lys 


Gly He 




695 






700 








705 


Gin 


Gly Gin 


Lys Gly (jiu 


Asn 


Gly Arg 


Gin 


Gly 


He 


Pro Gly Gin 




710 






715 








720 


Gin 


Gly He 


Gin Gly His 


HIS 


Gly Ala 


Lys 


Gly 


Glu 


Arg 


Gly Glu 




725 






730 








735 


Lys 


Gly Glu 


Pro Gly Val 


Arg 


Gly Ala 


He 


Gly 


Ser 


Lys 


Gly Glu 


740 






745 








750 


Ser 


Gly Val 


Asp Gly Leu 


Met 


Gly Pro 


Ala Gly 


Pro 


Lys 


Gly Gin 




755 






760 








765 


Pro 


Gly Asp 


Pro Gly Pro 


Gin 


Gly Pro 


Pro Gly 


Leu 


Asp 


Gly Lys 




770 






775 








780 


Pro 


Gly Arg 


Glu Phe Ser 


Glu 


Gin Phe 


He 


Arg 


Gin 


Val 


Cys Thr 




785 






790 








795 


Asp 


Val He 


Arg Ala Gin 


Leu 


Pro Val 


Leu 


Leu 


Gin 


Ser 


Gly Arg 




800 






805 








810 


He 


Arg Asn 


Cys Asp His 


Cys 


Leu Ser 


Gin 


His 


Gly 


Ser 


Pro Gly 




815 






820 








825 


He 


Pro Gly 


Pro Pro Gly 


Pro 


He Gly 


Pro 


Glu 


Gly 


Pro 


Arg Gly 




830 






835 








840 


Leu 


Pro Gly Leu Pro Gly Arg Asp Gly Val 


Pro 


Gly 


Leu 


Val Gly 






845 






850 








855 


Val 


Pro Gly Arg Pro Gly Val 


Arg Gly 


Leu 


Lys 


Gly 


Leu 


Pro Gly 






860 






865 








870 


Arg Asn Gly 


Glu Lys Gly 


Ser 


Gin Gly 


Phe 


Gly Tyr 


Pro 


Gly Glu 
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SEQUENCE 


LISTING 














8 /b 








DDfl 
OOU 








885 


Gin Gly 


Pro 


Pro 


Gly 


Pro 


Prn m v 


Pro 


Glu 


Glv 


IT I D 


Drn fll \/ 


Tip 






q q n 
o y u 








o y 








900 


Ser Lys 


Glu 


Gly 


Pro 


Pro 


m v Aqn 


Pro 


Glv 


Leu 


Drn 


\r T ,\/Q 

ijiy j_iy o 
















910 








915 


Gly Asp 


TT -i j— • 

HIS 


(jiy 


Lys 


irlTO 


Gly He 


Gin 


Glv 


Gin 


Pro 


Gly Pro 


Pro 






















930 


Gly He 


Cys 


Asp 


Pro 


Ser 


Leu Cys 


Phe 


Ser 


Val 


J. ± c 










935 








940 








945 


Asp Pro 


Phe 


Arg 


Lys 


Gly 


Pro Asn 


Tyr 
















950 






954 













<210> 2 

<211> 171 

<212> PRT 

<213> Homo sapiens 



<220> 




























<223> 


von 


Willebrand factor A 


domain 












<400> 


2 


























Asp Leu 


Val 


Phe 


He 


Leu 


Asp 


Gly 


Ser 


Tyr 


Ser 


Val 


Gly 


Pro 


Glu 


1 






5 










10 












Asn Phe 


Glu 


He 


Val 


Lys 


Lys 


Trp 


Leu 


Val 


Asn 


He 


Thr 


Lys 


Asn 








20 










25 










30 


Phe Asp 


He 


Gly 


Pro 


Lys 


Phe 


He 


Gin 


Val 


Gly Val 


Val 


Gin 


Tyr 






35 










40 










45 


Ser Asp 


Tyr 


Pro 


Val 


Leu 


Glu 


He 


Pro 


Leu 


Gly 


Ser 


Tyr 


Asp 


Ser 




50 










55 










60 


Gly Glu 


His 


Leu 


Thr 


Ala 


Ala 


Val 


Glu 


Ser 


He 


Leu 


Tyr 


Leu 


Gly 






65 










70 










75 


Gly Asn 


Thr 


Lys 


Thr 


Gly 


Lys 


Ala 


He 


Gin 


Phe 


Ala 


Leu 


Asp 


Tyr 






80 










85 










90 


Leu Phe 


Ala 


Lys 


Ser 


Ser 


Arg 


Phe 


Leu 


Thr 


Lys 


He 


Ala 


Val 


Val 






95 










100 










105 


Leu Thr 


Asp 


Gly 


Lys 


Ser 


Gin 


Asp 


Asp 


Val 


Lys 


Asp 


Ala 


Ala 


Gin 






110 










115 










120 


Ala Ala 


Arg 


Asp 


Ser 


Lys 


He 


Thr 


Leu 


Phe 


Ala 


He 


Gly Val 


Gly 






125 










130 










135 


Ser Glu 


Thr 


Glu 


Asp 


Ala 


Glu 


Leu 


Arg 


Ala 


He 


Ala 


Asn 


Lys 


Pro 








140 










145 










150 


Ser Ser 


Thr 


Tyr 


Val 


Phe 


Tyr 


Val 


Glu 


Asp 


Tyr 


He 


Ala 


He 


Ser 






155 










160 










165 


Lys He 


Arg 


Glu 


Val 


Met 


























170 


171 





















<210> 3 
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SEQUENCE LISTING 

<211> 183 
<212> PRT 
<213> Homo sapiens 

<220> 

<223> Thrombospondin N- terminal -like domain 



<400> 3 



Gly 


Phe 


Asp 


He 


Leu 


Leu 


Gly Leu Asp Val 


Asn 


Lys 


Lys 


Val 


Lys 


X 








c. 
D 










10 










15 


Lys 


Arg 


He 


Gin 


Leu 


Ser 


trJL CJ 


T irQ 

i_iy o 


Lys 


He 
25 


Lys 


Gly 


Tyr 


Glu 


Val 
30 


Thr 


Ser 


Lys 


Val 


Asp 


Leu 


Ocl 


\JJ- u 


Leu 


Thr 
40 


Ser 


Asn 


Val 


Phe 


Pro 
45 


Glu 


Gly 


Leu 


Pro 


Pro 


Ser 


iyr 


V Ct J_ 


Phe 


Val 
55 


Ser 


Thr 


Gin 


Arg 


Phe 
60 


Lys 


Val 


Lys 


Lys 


He 


Trp 


Asp 


Leu 


Trp Arg 


He 


Leu 


Thr 


He 


Asp 










D 3 










70 










75 


Gly 


Arg 


Pro 


Gin 


He 

o n 
o U 


Ala 


val 


Thr 


Leu 


Asn 
85 


Gly 


Val 


Asp 


Lys 


He 

-7 \J 


Leu 


Leu 


Phe 


Thr 


Thr 
95 


Thr 


Ser 


Val 


He 


Asn 
100 


Gly 


Ser 


Gin 


Val 


Val 
105 


Thr 


Phe 


Ala 


Asn 


Pro 
110 


Gin 


Val 


Lys 


Thr 


Leu 
115 


Phe 


Asp 


Glu 


Gly 


Trp 
120 


His 


Gin 


He 


Arg 


Leu 
125 


Leu 


Val 


Thr 


Glu 


Gin 
130 


Asp 


Val 


Thr 


Leu 


Tyr 
135 


He 


Asp 


Asp 


Gin 


Gin 
140 


He 


Glu 


Asn 


Lys 


Pro 
145 


Leu 


His 


Pro 


Val 


Leu 
150 


Gly 


He 


Leu 


He 


Asn 
155 


Gly 


Gin 


Thr 


Gin 


He 
160 


Gly 


Lys 


Tyr 


Ser 


Gly 
165 


Lys 


Glu 


Glu 


Thr 


Val 
170 


Gin 


Phe 


Asp 


Val 


Gin 
175 


Lys 


Leu 


Arg 


He 


Tyr 
180 



Cys Asp Pro 
183 



<210> 4 

<211> 509 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> collagenous domain 

Pro in this sequence stands for hydroxyproline 

<400> 4 

Gly Lys Pro Gly Leu Gin Gly Pro Lys Gly Asp Pro Gly Leu Pro 
1 5 10 15 
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SEQUENCE LISTING 



Gly Asn 


Pro 


Gly Tyr 


Pro Gly Gin 


Pro Gly Gin 


Asp Gly Lys 


Pro 




20 






25 






30 


Gly Tyr Gin Gly He 


Ala Gly Thr 


Pro Gly Val 


Pro Gly Ser 


Pro 






35 






40 






45 


Gly He 


Gin 


Gly Ala Arg Gly Leu 


Pro Gly Tyr 


Lys 


Gly Glu 


Pro 




50 






55 






60 


Gly Arg Asp Gly Asp Lys Gly Asp Arg Gly Leu 


Pro Gly Phe 


Pro 






65 






70 






75 


Gly Leu 


His 


Gly Met 


Pro Gly Ser 


Lys Gly Glu 




Gly Ala 


Lys 




80 






85 






90 


Gly Asp 


Lys 


Gly Ser 


Pro 


Gly Phe 


Tyr Gly Lys 


Lys 


Gly Ala 


Lys 


95 






100 






105 


Gly Glu 


Lys Gly Asn Ala Gly Phe 


Pro Gly Leu 


rlO 


Gly Pro 


Ala 




110 






115 






120 


Gly Glu 


Pro Gly Arg His Gly Lys Asp Gly Leu 


1*1 c t. 


Gly Ser 


Pro 




125 






i "i n 






135 


Gly Phe 


Lys 


Gly Glu 


Ala 


Gly Ser 


Pro Gly Aia 


T") r~\ 

Fro 


Gly Gin Asp 


140 






X % -3 






150 


Gly Thr 


Arg 


Gly Glu 


Pro 


Gly He 


Pro Gly Pne 


rlO 


Gly Asn Arg 


155 












IBS 


Gly Leu 


Met 


Gly Gin Lys 


Gly Glu 


lie Gly pro 


Pro 


uxy ijin 


Pi n 




170 






1 1 c 
I/O 






-LOW 


Gly Lys 


Lys 


Gly Ala 


Pro 


Gly Met 


Pro Gly Leu 


i v ie l 


Hi if C o -y 






185 






i on 






195 


Gly Ser 


Pro Gly Gin 


Pro 


Gly Thr 


Pro Gly Ser 


Lys 


f~~* T « w O r~\ ~v 

ijiy ser 


T iro 

Lyb 




200 






one 






210 


Gly Glu 


Pro 


Gly He 


Gin 


Gly Met 


Pro Gly Ala 


Ser 


Gly Leu 


Lys 




215 






220 






225 


Gly Glu 


Pro 


Gly Ala 


Thr Gly Ser 


Pro Gly Glu 


Pro 


Gly Tyr 


Met 




230 






235 






240 


Gly Leu 


Pro 


Gly He 


Gin 


Gly Lys 


Lys Gly Asp 


Lys 


Gly Asn 


Gin 




245 






250 






255 


Gly Glu 


Lys 


Gly He 


Gin 


Gly Gin 


Lys Gly Glu 


Asn 


Gly Arg 


Gin 


260 






265 






270 


Gly He 


Pro 


Gly Gin 


Gin 


Gly He 


Gin Gly His 


HIS 


Gly Ala 


Lys 




275 






280 






285 


Gly Glu Arg 


Gly Glu 


Lys 


Gly Glu 


Pro Gly Val 


Arg 


Gly Ala 


He 






290 






295 






300 


Gly Ser 


Lys 


Gly Glu 


Ser 


Gly Val 


Asp Gly Leu 


Met 


Gly Pro 


Ala 


305 






310 






315 


Gly Pro 


Lys 


Gly Gin 


Pro 


Gly Asp 


Pro Gly Pro 


Gin 


Gly Pro 


Pro 


320 






325 






330 


Gly Leu 


Asp 


Gly Lys 


Pro 


Gly Arg 


Glu Phe Ser 


Glu 


Gin Phe 


He 




335 






340 






345 


Arg Gin 


Val 


Cys Thr 


Asp 


Val He 


Arg Ala Gin 


Leu 


Pro Val 


Leu 




350 






355 






360 


Leu Gin 


Ser 


Gly Arg 
365 


He 


Arg Asn 


Cys Asp His 
370 


Cys 


Leu Ser 


Gin 
375 
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His 


Gly 


Ser Pro 


Giy 


lie fro 




I: X vj 


D-rn 
I: i. U 


Gly 


Pro 


He 


Glv 


Pro 






380 


















3 90 

~J S \J 


Glu 


Gly 


Pro Arg 


Giy 


Lieu riu 


ui y 


T .01 1 


IT -L w 


Gly Arg 


Asp 


Gly 


Val 




395 








/I Pi Pi 

4 U U 










405 


Pro 


Gly 


Leu Val 


Gly 


val flLO 


o± y 


A y~pt 


IT -L W 


Gly Val 


Ara 


Gly 


Leu 






410 








4X3 










420 


Lys 


Gly 


Leu Pro 


Gly 


Arg asu 


pi v 
laXy 


m n 
ol U 


jjy o 


Gly 


Ser 


Gin 


Glv 


Phe 




425 


















435 


Gly Tyr 


Pro Gly 


GlU 


lain ijiy 


Dy-A 

ri U 






Pro 


Pro 


Glv 


Pro 


Glu 








440 








A A c: 

4 1 D 










450 


Gly 


Pro 


Pro Gly 


He 


ber Liys 


pi n 
IjXU. 


Pl v 

kjX y 


Prn 
I 1 u 


Pro 


Gly 


Asp 


Pro 


Gly 






455 








a n 










465 


Leu 


Pro 


Gly Lys 


Asp 


<aiy ASp 


nib 


ox y 


j_jy o 


Pro 


Gly 


He 


Gin 


Gly 






470 








4/0 










480 


Gin 


Pro 


Gly Pro 


Pro 


Gly He 


Cys 


Asp 


Pro 


Ser 


Leu 


Cys 


Phe 


Ser 






485 








490 










495 


Val 


He 


Ala Arg 


Arg 


Asp Pro 


Phe 


Arg 


Lys 


Gly 


Pro 


Asn 


Tyr 








500 








505 








509 





<210> 5 

<211> 2865 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> alpha 1 chain collagen 

<400> 5 



atggctcact 


atattacatt 


tctctgcatg 


gttttggtgc 


tgcttcttca 


gaattctgtg 


60 


ttagctgaag 


atggggaagt 


aagatcaagt 


tgtcgtactg 


ctccgacaga 


tttagttttc 


120 


atcttagatg 


gctcttatag 


tgttggccca 


gaaaactttg 


aaatagtgaa 


aaagtggctt 


180 


gtcaatatca 


caaaaaactt 


tgacataggg 


ccgaagttta 


ttcaagttgg 


agtggttcaa 


240 


tatagtgact 


accctgtgct 


ggagattcct 


ctcggaagct 


atgattcagg 


agaacatttg 


300 


acggcagcag 


tggaatccat 


actctactta 


ggaggaaaca 


caaagacagg 


gaaggccatc 


360 


cagtttgcgc 


tcgattacct 


ttttgccaag 


tcctcacgat 


ttctgactaa 


gatagcagtg 


420 


gtacttacgg 


atggcaaatc 


ccaagatgac 


gtcaaggatg 


cagctcaagc 


agcaagagat 


480 


agtaagataa 


cattatttgc 


tattggtgtt 


ggttcagaaa 


cagaagatgc 


cgaacttaga 


540 


gctattgcca 


acaagccttc 


gtctacttat 


gtgttttatg 


tggaagacta 


tattgcaata 


600 
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tccaaaataa gggaagtgat gaagcagaaa ctttgtgaag aatctgtctg tccaacacga 660 

attccagtgg cagctcgtga tgaaagggga tttgatattc ttttaggttt agatgtaaat 720 

aaaaaggtta agaaaagaat acagctttca ccaaaaaaga taaaaggata tgaagtaaca 780 

tcaaaagttg atttatcaga actcacaagc aatgttttcc cagaaggtct tcctccatca 840 

tatgtatttg tgtctactca aagatttaaa gtcaagaaaa tttgggattt atggagaata 900 

ttaactattg atggaaggcc acaaatagca gttaccttaa atggtgtgga caaaatctta 960 

ttatttacaa caaccagcgt aattaatggc tcacaagtgg ttacctttgc taaccctcaa 1020 

gttaagacgt tgtttgatga aggctggcac caaattcgtc tcttagtaac agaacaagat 1080 

gtgactttgt atattgatga ccaacaaatt gaaaacaagc ccttacatcc agttttaggg 1140 

atcttgatca atgggcaaac ccaaattgga aaatattctg gaaaagaaga aactgttcag 1200 

tttgatgtcc aaaagttgcg aatctactgt gacccagaac agaacaaccg ggagacagca 1260 

tgtgagattc ctggattttg ccttaatggt cccagtgatg taggttcaac tccagctccc 1320 

tgtatttgtc ctccgggaaa accaggactt caaggcccca aaggtgaccc tggactgcct 13 80 

gggaaccctg gctaccctgg acaacctggt caagatggta agcctggata tcagggaatt 144 0 

gcagggacac caggtgttcc aggatctcca ggaatacaag gagctcgagg actaccaggt 1500 

tacaaaggag aaccagggcg agatggtgac aagggtgatc gtggacttcc tggttttcct 1560 

gggcttcatg gcatgccagg atcaaagggt gaaatgggtg ccaaaggaga caaaggatca 162 0 

cctggatttt atggcaaaaa gggtgcaaaa ggtgaaaagg ggaatgctgg cttccctggc 1680 

ctccctggac ctgctggaga accaggaaga catggaaagg atggattaat gggtagtccc 174 0 

ggtttcaagg gagaagcagg atcccctggt gctccggggc aggatggaac acggggagag 1800 

cctggaatcc caggatttcc tggaaaccga ggattaatgg gccaaaaggg agaaattggg 1860 

cctccaggac agcaaggaaa aaaaggagcc ccagggatgc ctggtttaat gggaagcaat 1920 

ggctcaccag gccagcctgg aacaccggga tctaagggaa gcaaaggtga acctggaatt 1980 

caagggatgc ctggggcttc tgggctcaag ggagaaccag gagcaacggg ttccccagga 2 040 
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SEQUENCE LISTING 








gaaccaggat 


acatgggttt 


2 t~* f-* o (~~\e~\<~i a h t" 


v_- ddy y ddddd 


agggggacaa 


aggaaatcaa 


2100 


ggtgaaaaag 


gtattcaggg 


LCaaaaggyd 


yd.cicictL.yya.ci 


gacagggaat 


tccagggcaa 


2160 


cagggaattc 


aaggccatca 


i-yy Ly L-dctad 


yy ciy c*.y 


gtgaaaaggg 


agaacctggt 


2220 


gtccgaggtg 


ccattggatc 


aaaayyaydd 


fn}- ft n not" nn 

t*<~uyyyy <-yy 


atggcttgat 


ggggcccgca 


2280 


ggtcctaagg 


ggcaacctgg 


yya L.(_cdy y l. 


pp1~ r^nao^ p 


ccccaggttt 


ggatgggaag 


2340 


cccggaagag 


agttttcaga 


dCaaLLLaCL 


L,y dOddy u u u 


gcacagatgt 


aataagagcc 


2400 


cagctaccag 


tcttacttca 


gagtggaaga 


aL L dy ddd i_ u 


gtgatcattg 


cctgtcccaa 


2460 


catggctccc 


cgggtattcc 




rfrrt - pppj^ t" 


gcccagaggg 


tcccagagga 


2520 


ttacctggtt 


tgccaggaag 


ciy ctuyy uyu l. 


ppi- nri 3 . t" t" r5 Cf 


tgggtgtccc 


tggacgtcca 


2580 


ggtgtcagag 


gattaaaagg 


/-*/-< t~~ -a /-I /-"» a (~1C~f a 

ot-L-ciL-L-ciyyct 


^nsaat" on no 

dy duat zpzJ_3«3 


aaaaagggag 


ccaagggttt 


2640 


gggtatcctg 


gagaacaagg 


t~ /"» "K f~* 1~ <TfT i - 

LCi-LttLyyL 


p p ppp^ prrrt - p 


cagagggccc 


tcctggaata 


2700 


agcaaagaag 


gtcctccagg 


agacccaggt 


ctccctggca 


aagatggaga 


ccatggaaaa 


2760 


cctggaatcc 


aagggcaacc 


aggcccccca 


ggcatctgcg 


acccatcact 


atgttttagt 


2820 


gtaattgcca 


gaagagatcc 


gttcagaaaa 


ggaccaaact 


attag 




2865 
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